Hierarchical Landmark Policy Optimization for Visual Indoor Navigation
نویسندگان
چکیده
In this paper, we study the problem of visual indoor navigation to an object that is defined by its semantic category. Recent works have shown significant achievements in end-to-end reinforcement learning approach and modular systems. However, both approaches need a big step forward be robust practically applicable. To solve insufficient exploration scenes make more semantically meaningful, extend standard task formulation give agent easily accessible landmarks form room locations those types. The availability allows build hierarchical policy structure achieve success rate 63% on validation photo-realistic Habitat simulator. hierarchy, low level consists separately trained RL skills high deterministic policy, which decides skill needed at moment. Also, show possibility transferring real robot. After bit training reconstructed scene, robot shows up 79% SPL when solving navigating arbitrary object.
منابع مشابه
A Visual Landmark Framework for Indoor Mobile Robot Navigation
This article presents vision functions needed on a mobile robot to deal with landmark-based navigation in buildings. Landmarks are planar, quadrangular surfaces, which must be distinguished from the background, typically a poster on a wall or a door-plate. In a first step, these landmarks are detected and their positions with respect to a global reference frame are learned; this learning step i...
متن کاملVisual Landmark Selection for Mobile Robot Navigation
A large number of landmarks selection techniques has been proposed. However, finding optimal solutions requires to solve some hard problems. In this paper, we consider the ρminimum overlapping region decomposition problem that was proposed for landmarks selection. This problem is NP-complete. We describe an approach to solve the problem optimally. This approach is based on an explicit reduction...
متن کاملVisual Nouns for Indoor/Outdoor Navigation
We propose a local orientation and navigation framework based on visual features that provide location recognition, context augmentation, and viewer localization information to a human user. Mosaics are used to map local areas to ease user navigation through streets and hallways, by providing a wider field of view (FOV) and the inclusion of more decisive features. Within the mosaics, we extract...
متن کاملState-based SHOSLIF for indoor visual navigation
In this paper, we investigate vision-based navigation using the self-organizing hierarchical optimal subspace learning and inference framework (SHOSLIF) that incorporates states and a visual attention mechanism. With states to keep the history information and regarding the incoming video input as an observation vector, the vision-based navigation is formulated as an observation-driven Markov mo...
متن کاملVisual Landmark Navigation Through Large-scale Environments
Several theoretical models have been proposed for the visual homing behaviour of insects. We take one such model for visual homing, and extend it to an algorithm capable of autonomous exploration and navigation through large-scale environments. The algorithm uses a novel approach to waypoint selection during the construction of multi-leg routes. By locating the boundaries between visual locales...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2022
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2022.3182803